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H ! Abstract 

Motivated by the fact that in many game-theoretic settings, the game analyzed is only an approxi- 
mation to the game being played, in this work we analyze equilibrium computation for the broad and 
natural class of bimatrix games that are stable to perturbations. We specifically focus on games with the 
property that small changes in the payoff matrices do not cause the Nash equilibria of the game to fluctu- 
ate wildly. For such games we show how one can compute approximate Nash equilibria more efficiently 
than the general result of Lipton et al. Il8ll . by an amount that depends on the degree of stability of the 
game and that reduces to their bound in the worst case. Furthermore, we show that for stable games 
the approximate equilibria found will be close in variation distance to true equilibria, and moreover this 
holds even if we are given as input only a perturbation of the actual underlying stable game. 

For uniformly-stable games, where the equilibria fluctuate at most quasi-linearly in the extent of the 
I/"") ' perturbation, we get a particularly dramatic improvement. Here, we achieve a fully quasi-polynomial- 

time approximation scheme: that is, we can find l/poly(ri)-approximate equilibria in quasi-polynomial 
time. This is in marked contrast to the general class of bimatrix games for which finding such approx- 
imate equilibria is PPAD-hard. In particular, under the (widely believed) assumption that PPAD is not 
contained in quasi-polynomial time, our results imply that such uniformly stable games are inherently 
easier for computation of approximate equilibria than general bimatrix games. 
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1 Introduction 



^ ; The Nash equilibrium solution concept has a long history in economics and game theory as a description for 

the natural result of self-interested behavior B2(X 12211 . Its importance has led to significant effort in the com- 



puter science literature in recent years towards understanding their computational structure, and in particular 
on the complexity of finding both Nash and approximate Nash equilibria. A series of results culminating 
in the work by Daskalakis, Goldberg, and Papadimitriou [10] and Chen, Deng, and Teng |0, 0] showed 



that finding a Nash equilibrium or even a l/poly(n)-approximate equilibrium, is PPAD-complete even for 
2-player bimatrix games. For general values of e, the best known algorithm for finding e-approximate 



equilibria runs in time n°^ logn ^ e \ based on a structural result of Lipton et al. II 1811 showing that there 
always exist e-approximate equilibria with support over at most 0((logn)/e 2 ) strategies. This structural 



result has been shown to be existentially tight 111 511 . Even for large values of e, despite considerable ef- 



fort EIlEliilHB [nil , polynomial-time algorithms for computing e-approximate equilibria are known 
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only for e > 0.3393 12 111 . These results suggest a difficult computational landscape for equilibrium and 
approximate equilibrium computation on worst-case instances. 

In this paper we go beyond worst-case analysis and investigate the equilibrium computation problem in a 
natural class of bimatrix games that are stable to perturbations. As we argue, on one hand, such games can be 
used to model many realistic situations. On the other hand, we show that they have additional structure which 
can be exploited to provide better algorithmic guarantees than what is believed to be possible on worst-case 
instances. The starting point of our work is the realization that games are typically only abstractions of 
reality and except in the most controlled settings, payoffs listed in a game that represents an interaction 
between self-interested agents are only approximations to the agents' exact utilities. As a result, for 
problems such as equilibrium computation, it is natural to focus attention to games that are robust to the 
exact payoff values, in the sense that small changes to the entries in the game matrices do not cause the 
Nash equilibria to fluctuate wildly; otherwise, even if equilibria can be computed, they may not actually be 
meaningful for understanding behavior in the game that is played. In this work, we focus on such games 
and analyze their structural properties as well as their implications to the equilibrium computation problem. 
We show how their structure can be leveraged to obtain better algorithms for computing approximate Nash 
equilibria, as well as strategies close to true Nash equilibria. Furthermore, we provide such algorithmic 
guarantees even if we are given only a perturbation of the actual stable game being played. 

To formalize such settings we consider bimatrix games G that satisfy what we call the (e, A) perturbation 
stability condition, meaning that for any game G' within distance e of G (each entry changed by at most 
e), each Nash equilibrium (p', q') in G' is A-close to some Nash equilibrium (p, q) in G, where closeness is 
given by variation distance. Clearly, any game is (e, 1) perturbation stable for any e and the smaller the A 
the more structure the (e, A) perturbation stable games have. In this paper we study the meaningful range 
of parameters, several structural properties, and the algorithmic behavior of these games. 

Our first main result shows that for an interesting and general range of parameters the structure of perturba- 
tion stable games can be leveraged to obtain better algorithms for equilibrium computation. Specifically, we 
show that all n-action (e, A) perturbation-stable games with at most n°^ A / £ ) ) Nash equilibria must have a 

well-supported e-equilibrium of support 0( A lo g(J, +A — 1 log re). This yields an n 0ly ^ logn ^-time 
algorithm for finding an e-equilibrium, improving by a factor 0(1/ A 2 log(l + A -1 )) in the exponent over 
the bound of 1 11 811 for games satisfying this condition (and reducing to the bound of 11811 in the worst-case 



when A = 1)13 Moreover, the stability condition can be further used to show that the approximate equilib- 
rium found will be A-close to a true equilibrium, and this holds even if the algorithm is given as input only 
a perturbation to the true underlying stable game. 

A particularly interesting class of games for which our results provide a dramatic improvement are those that 
satisfy what we call t-uniform stability to perturbations. These are games that for some eo = l/poly(n) and 
some t satisfy the (e, te) stability to perturbations condition for all e < eo- For games satisfying t-uniform 
stability to perturbations with t = poly(log(n)), our results imply that we can find l/poly(n)-approximate 
equilibria in n poly ( log ( n )) time, i.e., achieve a fully quasi-polynomial-time approximation scheme (FQP- 
TAS). This is especially interesting because the results of M\ prove that it is PPAD-hard to find l/poly(re)- 
approximate equlibria in general games. Our results shows that under the (widely believed) assumption that 
PPAD is not contained in quasi -polynomial time |@), such uniformly stable game are inherently easier for 



'For example, if agents are two corporations with various possible actions in some proposed market, the precise payoffs to the 
corporations may depend on specific quantities such as demand for electricity or the price of oil, which cannot be fully known in 
advance but only estimated. 

2 One should think of A as a function of e, with both possibly depending on n. E.g., e — 1 /y/n and A = 6e. 
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computation of approximate equilibria than general bimatrix gamesJl Moreover, variants of many games 
appearing commonly in experimental economics including the public goods game, matching pennies, and 
identical interest game 11411 satisfy this condition. See Sections [HO and Appendix Ofor detailed examples. 

Our second main result shows that computing an e-equilibrium in a game satisfying the (e, 0(e 1 / 4 )) per- 
turbation stability condition is as hard as computing a 0(e 1 / 4 )-equilibrium in a general game. For our 
reduction, we show that any general game can be embedded into one having the (e, 0(e 1,/4 )) perturbation 
stability property such that an e equilibrium in the new game yields a 0(e 1 / 4 )-equilibrium in the original 
game. This result implies that the interesting range for the (e, A) -perturbation stability condition, where one 
could hope to do significantly better than in the general case, is A = o(e 1 / 4 ). 

We also connect our stability to perturbations condition to a seemingly very different stability to approxima- 
tions condition introduced by Awasthi et al. [Q]. Formally, a game satisfies the strong (e, A) -approximation 
stability condition if all e-approximate equilibria are contained inside a small ball of radius A around a 
single equilibrium^ We prove that our stability to perturbations condition is equivalent to a much weaker 
version of this condition that we call the well-supported approximation stability. This condition requires 
only that for any well-supported e-approximate equilibrium u (p, q) there exists a Nash equilibrium (p* , q*) 
that is A-close to (p, q). Clearly, the well supported approximation stability condition is more general than 
strong (e, A)-approximation stability considered by 12D since rather than assuming that there exists a fixed 
Nash equilibrium (p* , q* ) such that all e-approximate equilibria are contained in a ball of radius A around 
(p*,q*), it requires only that for any well-supported e-approximate equilibrium (p, q) there exists a Nash 
equilibrium (p*,q*) that is A-close to (p, q). Thus, perturbation-stable games are significantly more expres- 
sive than strongly approximation stable games and Section [3]presents several examples of games satisfying 
the former but not the latter. However, our lower bound (showing that the interesting range of parameters is 
A = 0(e 1//4 )) also holds for the strong stability to approximations condition. 

We also provide an interesting structural result showing that each Nash equilibrium of an (e, A) perturbation 
stable game with n°^ A//e ) ) Nash equilibria must be 8A-close to well-supported e-approximate equilibrium 
of support only 0( A lo g(i+ A — ) \ Q g n y Similarly, a i-uniformly stable game with equilibria has the 

property that for any A, each equilibrium is 8 A-close to a well-supported A/t-approximate equilibrium 
with support of size 0(t 2 log(l + A -1 ) log n). This property implies that in quasi-polynomial time we can 
in fact find a set of approximate-Nash equilibria that cover (within distance 8A) the set of all Nash equilibria 
in such games. 

It is interesting to note that for our algorithmic results for finding approximate equilibria we do not require 
knowing the stability parameters. If the game happens to be reasonably stable, then we get improved running 
times over the Lipton et al. II 1 8h guarantees; if this is not the case, then we fall back to the Lipton et al. H 1 SM 
guarantees^] However, given a game, it might be interesting to know how stable it is. In this direction, we 
provide a characterization of stable constant-sum games in Section [6] and an algorithm for computing the 



3 The generic result of frill achieves quasi-polynomial time only for e = fi(l/poly(log n)). 

4 0] argue that this condition is interesting since in situations where one would want to use an approximate Nash equilibrium 
for predicting how players will play (which is a common motivation for computing a Nash or an approximate Nash equilibrium), 
without such a condition the approximate equilibrium found might be far from the equilibrium played. 

5 Recall that in an e-Nash equilibrium, the expected payoff of each player is within e from her best response payoff; however 
the mixed strategies may include poorly-performing pure strategies in their support. By contrast, the support of a well-supported 
e-approximate equilibrium may only contain strategies whose payoffs fall within e of the player's best-response payoff. 

6 This is because algorithmically, we can simply try different support sizes in increasing order and stop when we find strategies 
forming a (well- supported) e-equilibrium. In other words, given e, the desired approximation level, we can find an e-approximate 

Q , A 2 logll + A- 1 ) . , 

equilibrium in time n S ognJ where A is the smallest value such that the game is (e, A) perturbation stable. 
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strong stability parameters of a given constant-sum game. 



1.1 Related Work 

In addition to results on computing (approximate) equilibria in worst-case instances of general bimatrix 
games, there has also been a series of results on polynomial-time algorithms for computing (approximate) 
equilibria in specific classes of bimatrix games. For example, Barany et al. 10] considered two-player games 
with randomly chosen payoff matrices, and showed that with high probability, such games have Nash equi- 
libria with small support. Their result implies that in random two-player games, Nash equilibria can be 
computed in expected polynomial time. Kannan and Theobald Jl^l provide an FPTAS for the case where 
the sum of the two payoff matrices has constant rank and Adsul et al. |Q]] provide a polynomial time algo- 
rithm for computing an exact Nash equilibrium of a rank- 1 bimatrix game. 

Awasthi et al. Q] analyzed the question of finding an approximate Nash in equilibrium in games that sat- 
isfy stability with respect to approximation. However, their condition is quite restrictive in that it focuses 
only on games that have the property that all the Nash equilibria are close together, thus eliminating from 
consideration most common games. By contrast, our perturbation stability notion, which (as mentioned 
above) can be shown to be a generalization of their notion, captures many more realistic situations. Our 
upper bounds on approximate equilibria can be viewed as generalizing the corresponding result of [2] and it 
is significantly more challenging technically. Moreover, our lower bounds also apply to the stability notion 
of |L2tl and provide the first (nontrivial) results about the interesting range of parameters for that stability 
notion as well. 

In a very different context, for clustering problems, Bilu and Linial |Q] analyze maxcut clustering instances 
with the property that if the distances are perturbed by a multiplicative factor of a, then the optimum does not 
change; they show that under this condition, for a = y/n, one can find the optimum solution in polynomial 
time. Recently, Awasthi et al. 0], have shown a similar result for the k-median clustering problem and 
showed a similar result for a = y/S. Our stability to perturbations notion is inspired by this work, but is 
substantially less restrictive in two respects. First, we require stability only to small perturbations in the 
input, and second, we do not require the solutions (Nash equilibria) to stay fixed under perturbation, but 
rather just ask that they have a bounded degree of movement. 

The notion of stability to perturbations we consider in our paper is also related to the stability notions consid- 



ered by Lipton et al. 01911 for economic solution concepts. The main focus of their work was understanding 
whether for a given solution concept or optimization problem all instances are stable. In this paper, our main 
focus is on understanding how rich the class of stable instances is, and what properties one can determine 
about their structure that can be leveraged to get better algorithms for computing approximate Nash equi- 
libria!!] We provide the first results showing better algorithms for computing approximate equilibria in such 
games. 



2 Preliminaries 

We consider 2-player general-sum n-action bimatrix games. Let R denote the payoff matrix to the row 
player and C denote the payoff matrix of the column player. If the row player chooses strategy i and the 



7 Just as in lfl9ll . one can show that for the stability conditions we consider in our paper, there exist unstable instances. 
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column player chooses strategy j, the payoffs are Rj and Qj respectively. We assume all payoffs are 
scaled to the range [0,1]. 

A mixed strategy for a player is a probability distribution over the set of his pure strategies. The ith pure 
strategy will be represented by the unit vector e^, that has 1 in the ith coordinate and elsewhere. For a 
mixed strategy pair (p, q), the payoff to the row player is the expected value of a random variable which is 
equal to Rij with probability piqj. Therefore the payoff to the row player is p T Rq. Similarly the payoff to 
the column player is p T Cq. Given strategies p and q for the row and column player, we denote by supp(p) 
and supp(q) the support of p and q, respectively. 

A Nash equilibrium 12011 is a pair of strategies (p*,q*) such that no player has an incentive to deviate 
unilaterally. Since mixed strategies are convex combinations of pure strategies, it suffices to consider only 
deviations to pure strategies. In particular, a pair of mixed strategies (p* , q* ) is a Nash-equilibrium if for 
every pure strategy i of the row player we have efRq* < p* T Rq* , and for every pure strategy j of the 
column player we have p* Cej < p* T Cq*. Note that in a Nash equilibrium (p*,q*), all rows i in the support 
of p* satisfy efRq* = p* T Rq* and similarly all columns j in the support of q* satisfy p* T Cej = p* T Cq*. 



Definition 1 A pair of mixed strategies (p, q) is an e-equilibrium if both players have no more than e incen- 
tive to deviate. Formally, (p, q) is an e-equilibrium if for all rows i, we have efRq < p 1 Rq + e, and for all 
columns j, we have p T Cej < p T Cq + e. 



Definition 2 A pair of mixed strategies (p, q) is a well supported e-equilibrium if for any i G supp(p) (i.e., 
i s. t. pi > 0) we have efRq > ejRq — e, for all j; similarly, for any i G supp(g) (i.e., i s. t. qi > 0) we 
have p T Cei > p T Cej — e, for all j. 

Definition 3 We say that a bimatrix game G' specified by R, C is an a-perturbation of G specified by 
R, C if we have \Rij — R[ j\ < a and \ Cij — C[j\ < afar all i, j G {1, . . . , n}. 

Definition 4 For two probability distributions q and q', we define the distance between q and q' as the 
variation distance: 

d{q, ^) = \Y1 \ qi ~ q i\ = X max (% - 9i> °) = X max (^ ~ °)- 

i i i 

We define the distance between two strategy pairs as the maximum of the row-player's and column-player's 
distances, that is: d((p,q),(p' ,q')) = m&x[d(p,p'),d(q,q')]. 

It is easy to see that d is a metric. If d((p, q), (p' , q')) < A, then we say that (p', q') is A-close to (p, q). 
Throughout this paper we use "log" to mean log-base-e. 



3 Stable Games 

The main notion of stability we introduce and study in this paper requires that any Nash equilibrium in a per- 
turbed game be close to a Nash equilibrium in the original game. This is an especially motivated condition 
since in many real world situations the entries of the game we analyze are merely based on measurements 
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and thus only approximately reflect the players' payoffs. In order to be useful for prediction, we would like 
that equilibria in the game we operate with be close to equilibria in the real game played by the players. 
Otherwise, in games where certain equilibria of slightly perturbed games are far from all equilibria in the 
original game, the analysis of behavior (or prediction) will be meaningless. Formally: 

Definition 5 A game G satisfies the (e, A) stability to perturbations condition if for any G' that is an 
e-perturbation ofG and for any Nash equilibrium (p, q) in G', there exists a Nash equilibrium (p* , q*) in G 
such that (p, q) is A-close to (p*,q*)- □ 

Observe that fixing e, a smaller A means a stronger condition and a larger A means a weaker condition. 
Every game is (e, 1) -perturbation stable, and as A gets smaller, we might expect for the game to exhibit 
more useful structure. 

Another stability condition we consider in this work is stability to approximations: 

Definition 6 A game satisfies the (e, A) -approximation stability condition if for any e-equilibrium (p, q) 
there exists a Nash equilibrium (p*,q*) such that (p, q) is A-close to (p* , q*). 

A game satisfies the well supported (e, A) -approximation stability condition if for any well supported e- 
equilibrium (p, q) there exists a Nash equilibrium (p*,q*) such that (p, q) is A-close to (p*,q*). 

Clearly, if a game satisfies the (e, A) -approximation stability condition, then it also satisfies the well sup- 
ported (e, A)-approximation stability condition. Interestingly, we show that the stability to perturbations 
condition is equivalent to the well supported approximation stability condition. Specifically: 

Theorem 1 A game satisfies the well supported (2e, A)-approximation stability condition if and only if it 
satisfies the (e, A)-stability to perturbations condition. 

Proof: Consider an n x n bimatrix game specified by R and C and assume it satisfies the well supported 
(2e, A)-approximation stability condition; we show it also satisfies the (e, A)-stability to perturbations con- 
dition. Consider R' = R + T and C' = C + A, where \Tij\ < e and \Aij\ < e, for all i, j. Let (p, q) 
be an arbitrary Nash equilibrium in the new game specified by R' and C' . We will show that (p, q) is a 
well supported 2e-approximate Nash equilibrium in the original game specified by R and C. To see this, 
note that by definition, (since (p, q) is a Nash equilibrium in the game specified by R' and C') we have 
ejR'q < p T R'q = vr for all j; therefore ejRq + ejFq < vr, so ejRq < vr + e, for all j. On the other 
hand we also have ej Rq = ejR'q — ejTq > vr — e for all i £ supp(p). Therefore, ej Rq > ejRq — 2e, 
for all i G supp(p) and for all j. Similarly we can show p T Ce{ > p T Cej — 2e, for all i G supp(g) and 
for all j. This implies that (p, q) is well supported 2e-approximate Nash in the original game, and so by 
assumption is A-close to a Nash equilibrium of the game specified by R and C. So, this game satisfies the 
(e, A) -stability to perturbations condition. 

In the reverse direction, consider annxn bimatrix game specified by R and C and assume it satisfies the 
(e, A)-stability to perturbations condition. Let (p, q) be an arbitrary well supported 2e Nash equilibrium 

8 Note that the entries of the perturbed game are not restricted to the [0, 1] interval, and are allowed to belong to [— e, 1 + e]. 
This is a proper way to formulate the notion because it implies, for instance, that if G is (e, A) stable to perturbations, then for any 
a > 0, aG is (ae, A) stable to perturbations. TheoremfJJprovides further evidence that this definition is proper. 
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in this game. Let us define matrices R' and C such that efR'q = maxj/ ei/ T Rq — e for all i £ supp(p) 
and efR'q < maxj/ e^i?g — e for all i ^ supp(p), p T C'ej = maxy p T Ceji — e for all j E supp(g) and 
p T C'eji < maxj p T Cey — e for all j supp(g). Since (p, q) is a well supported 2e Nash equilibrium we 
know this can be done such that \ {R' — R)ij\ < e and |(C" — C)jj| < e, for all i, j (in particular, we have 
to add quantities in [— e, e] to all the elements in rows i of R in the support of p and subtract quantities in 
[0, e] from all the elements in rows i of R not in the support of p; similarly for q). By design, (p, q) is a 
Nash equilibrium in the game defined by R', C', and from the (e, A)-stability to perturbations condition, 
we obtain that it is A-close to a true Nash equilibrium of the game specified by R and C. Thus, any well 
supported 2e Nash equilibrium in the game specified by R and C is A-close to a true Nash equilibrium of 
this game, as desired. ■ 

One can show that the well supported approximation stability is a strict relaxation of the approximation 
stability condition. For example, consider the 2x2 bimatrix game 



R 



1 1 

1 - e 1 - e 



C 



1 1-eo 
1 1-eo 



For e < eo this game satisfies the well supported (e, 0) -approximation stability condition, but does not 
satisfy (e, A)-approximation stability for any A < e/eo. To see this note that ejRq = 1, e\Rq = 1 — eo, 
p T Ce\ = 1, and p T Ce2 = 1 — eo for any p and q. This implies that the only well supported e-Nash 
equilibrium is identical to the Nash equilibrium (1,0) T , (1,0) T , thus the game is well supported (e, 0)- 
approximation stable. On the other hand, the pair of mixed strategies (p, q) with p = (1 — e/eo, e/eo) T and 
q = (1 — e/eo, e / e o) T is an e-Nash equilibrium. The distance between (p, q) and the unique Nash is e/eo, 
thus this game is not (e, A)-approximation stable for any A < e/eo- 

Interestingly, the notion of approximation stability which is a restriction of the well-supported approximation 
stability and of stability to perturbations conditions is a relaxation of the stability condition considered by 
Awasthi et al. U which requires that all approximate equilibria be contained in a ball of radius A around 
a single Nash equilibrium. In this direction, we define the strong version of stability conditions given in 
Definitions 3 and 4 to be a reversal of quantifiers that asks there be a single (p*, q*) such that each relevant 
(p, q) (equilibrium in an e-perturbed game, e-approximate equilibrium, or well-supported e-approximate 
equilibrium) is A-close to (p*, q*). Formally: 

Definition 7 A game G satisfies the strong (e, A) stability to perturbations condition if there exists (p*,q*) 
a Nash equilibrium of G such that for any G' that is an L ro e-perturbation of G we have that any Nash 
equilibrium in G' is A-close to (p*,q*). 

A game G satisfies the strong (well supported) (e, A) -approximation stability condition if there exists 
(p*,q*) a Nash equilibrium ofG such that any (well supported) e-equilibrium (p, q) is A-close to (p*,q*)- 

It is immediate from its proof that Theorem Q] applies to the strong versions of the definitions as well. We 
also note that our generic upper bounds in Section [4] will apply to the most relaxed version (perturbation- 
stability) and our generic lower bound in Section [5] will be apply to the most stringent version (strong approx 
stability). 

Range of parameters As shown in |0], if a game Q satisfies the strong (e, A)-approximation stability 
condition and has a non-trivial Nash equilibrium (an equilibrium in which the players do not both have 
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full support), then we must have A > e. We can show that if a game Q satisfies the (e, A)-approximation 
stability and if the union of all A-balls around all Nash equilibria does not cover the whole spaced then we 
must have 3 A > e - see Lemma [5] in Appendix IB1 In Section [5j we further discuss the meaningful range of 
parameters from the point of view of the equilibrium computation problem. 



Examples Variants of many classic games including the public goods game, matching pennies, and iden- 
tical interest games are stable. As an example, consider the following modified identical interest game. Both 
players have n available actions. The first action is to stay home, and the other actions correspond to n — 1 
different possible meeting locations. If a player chooses action 1 (stay home), his payoff is 1/2 no matter 
what the other player is doing. If the player chooses to go out to a meeting location, his payoff is 1 if the 
other player is there as well and it is otherwise. This game has n pure equilibria (all (e,, e^)) and Q) 
equilibria (all (l/2ej + l/2ej, l/2ej + l/2ej)) and it is well-supported (e, 2e)-approximation stable for all 
e < 1/6. Note that it does not satisfy strong (well-supported) stability because it has multiple very distinct 
equilibria. For further examples see Lemma|2]in Section [5j as well as Appendix ICl 



4 Equilibria in Stable Games 

In this section we show we can leverage the structure implied by stability to perturbations to improve over 
the best known generic bound of IU8I1 . We start by considering e and A as given. We can show: 



Theorem 2 Let us fix e and A, < e < A < 1. Consider a game with at most n°^/ e ^ Nash equilib- 
ria which satisfies the well supported (e, /^-approximation stability condition (or the (e/2, ^-stability to 
perturbations condition). Then there exists a well supported e-equilibrium where each player's strategy has 
support 0((A/e) 2 log(l + A" 1 ) logn). 



This improves by a factor 0(l/(A 2 log(l + A in the exponent over the bound of II 1811 for games 



satisfying these conditions (and reduces to the bound of fll8h in the worst-case when A = 1). 



Proof idea We start by showing that any Nash equilibrium (p* , q*) of G must be highly concentrated. In 
particular, we show that for each of p*,q*, any portion of the distribution with substantial L\ norm (having 
total probability at least 8A) must also have high L2 norm: specifically the ratio of L2 norm to L\ norm 
must be ri((e/A)(logn)~ 1 / 2 ). This in turn can be used to show that each of p* , q* has all but at most 8A 
of its total probability mass is concentrated in a set (which we call the high part) of size Q((A/e) 2 log(l + 
1/A) log n). Once the desired concentration is proven, we can then perform a version of the |18(] sampling 
procedure on the low parts of p* and q* (with an accuracy of only e/ A) to produce overall an e-approximate 
equilibrium of support only a constant factor larger. The primary challenge in this argument is to prove 
that p* and q* are concentrated^ This is done through our key lemma, Lemma Q] below. In particular, 
Lemma[T]can be used to show that if p* (or q*) had a portion with substantial L\ norm and low L2 norm, 
then there must exist a deviation from p* (or q*) that is far from all equilibria and yet is a well-supported 

'if the union of all A-balls around all Nash equilibria does cover the whole space, this is an easy case from our perspective. 
Any (p, q) would be a e-equilibria. 

10 We note that 0] prove an upper bound for the strong approximation stability condition using the same concentration idea. 
However, proving the desired concentration is significantly more challenging in our case since we deal with many equilibria. 
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approximate-Nash equilibrium, violating the stability condition. Proving the existence of such a deviation 
is challenging because of the large number of equilibria that may exist. Lemma Q] synthesizes the key points 
of this argument and it is proven through a careful probabilistic argument. 

In the following we consider c = (56) 2 and let d = 27. 

Lemma 1 Let us fix e and A, < e < A < 1. Let p be an arbitrary distribution over {1, 2, . . . , n}. Let 
S = c(A/e) 2 log n and fix j3 < 1 such that 1 — j3 > 8A. Assume that the entries ofp can be partitioned into 
two sets H and L such that \\pl\\i — 1 — A = A HplHI ^ ^~s^ • Let us fix kin-dimensional vectors 

V\,..., Vfcj w/f/i entries in [—1, 1] erne? /c2 distributions pi, . . . ,pk 2 > where k\ = n 2 and k 2 < n c '( A / e ) 2 . Then 
there exists p' with supp(p') C supp(p) swc/i 

1. d(p,p') = 3 A a«<f 

2. p' ■ vi < p ■ Vi + efor all i G {1, . . . , k±}. 

3. dip', Pi) > d(p,pi) - A for all i £ {1, . . . , A; 2 }.. 

Proof: We show the desired result by using the probabilistic method. Let us define the random variable 
Xi = 1 with probability 1/2 and Xj = with probability 1/2. Define 

Pi = pi, for i £ H and 

_ . 3A^X 4 3Api(l -Xi) 

Pi=Pi + ^ ^rr - ^ =7: ttt, for 2 € L. 

We have E[J2i£LPi x i] = (1 — /3)/2. By applying McDiarmid's inequality (see Theorem[6]) and using the 
fact that HplII 2 ; < , we obtain that with probability at least 3/4 we have both: 

Assume that this happens. In this case, p' is a legal mixed strategy for the row player and by construction 
we have d(p,p') = 3 A. 

Let v be an arbitrary vector in {v±, . . . , }. We have: 

p . v = p-v + 3A[ — — — — — - . 



Define 



Z\ = ^piXiVi, Z 2 = y]piXj, Z 3 = } pi(l-Xi)vi, Z 4 = y~]pi(l - Xi), 

i£L i&L i£L idL 



so we have: 



~z - . n A I Z\ Z 3 

p -v=p-v + 3A[ — - — 

Zj2 ^4 



Using McDiarmid's inequality we get that with probability at least 1 — 1 /n 3 , each of the quantities Z\,Z 2 ,Z 3 , Z± 
is within (^^)(^) of its expectation; we are using here the fact that the value of Xi can change any one 
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of the quantities by at most pi, so the exponent in the McDiarmid bound is — {-^) 2 Q-^-) 2 / YlieiPi — 
Us 

at least 1 — 1 /re 3 we have 

p' ■ v < p ■ v + 3A 



■(ap) log re. Also, we have E[Z 2 ] = E[Z 4 ] = K£-, and E[Zi] = E[Z 3 ]. So, we get that with probability 



p ■ v + 3A 

p ■ v + 3A 
2 



1 1— /3 (1-/3)6 1-/3 (l-/3)e 

\ 2 28A 2 ' 28A 

( T |^)E[Z 1 ] + ^ ( A)E[Z 3 ] 



1 



IIS 



E[Zx] + 



1-/3 



14A 



1 + 



1 + TS 
e 



14A 



.1-/3 
< p-v + SAA 
2 



E[Z 3 ] 



14A 



1-/3 



E[Zi] + 



14A 

e 



1 



l-(w) 2 



1-/3 



E[Z 3 ] 



14A 



14A 

e 



1 + 



14A 



14A 



l T ^ (E[2l]+E[23 ])( I ^) + 4) 



Finally, using the fact that Zi + Z 3 < X^ieL Pi = 1 ~~ & we § et 

p'-u < p-v + 3.lA(-^- + -^), 
7A 7A 

yielding the desired bound p' ■ v < p ■ v + e. Applying the union bound over alH G {1, . . . , ki} we obtain 
that the probability that there exists v in v\ , . . . , vj- x such that p' • v > p • v + e is at most 1/3. 

Consider an arbitrary distribution p in {pi, . . . ,Pk 2 }- Assume that p = p + g. By definition, we have: 



d(p,p' 



> 



> 



2 ^ 

-y. 

2 ^ 
2 ^ 

-y 

2 ^ 
2 ^ 

ieL 



Pi + 9i~ Pi 



9i 



9, 



3ApjXi | 3Apj(l - Xj) 
J2ieLPi X i E^iPii 1 ~ X * 



ieH 



3Ap t Xi | 3Api(l-Xi) 



1-/3 


+ 


1 - 


P 


GApiXi 


+ 


6Aft(l 




1-/3 




1 - 




GApiXi 


+ 


6Aft(l 


-x^ 



2 ^ 

ieL 



ieH 

6ApiXi 6Api(l-Xi 



5(1-/3) 7(1-/3) 



1-/3 



1-/3 



1 \ - 6Apj l 
3A 1 

i6H 



ieH 



where the first inequality follows from applying relation CD to the denominators, and the last equality follows 
from the fact that \\pl\U =1-/3. 
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Let us denote by Z = \ YlieL \di ~ 

ieL 



1-/3 



+ 



1-/3 



9i 



1-/3 



+ 



. We have: 
6Ak 



9i + 



1-/3 



> 



ieL 



therefore 



/ . -j ^ rr7l lv-, 3A lv-^, lv^, 3A 3A 

£7[d(p,p')] > £7[Z] + \9i\ ~ — > 2 E 1^1 + 2E 1^1 " — = d (P'f) " X' 

iGif ieL ieH 

We can now apply McDiarmid's inequality (see Theorem [6]) to argue that with high probability Z is within 
2 A/5 of its expectation. Note that a = ^Mgp. Therefore: 

-2A 2 (l-,3) 2 /(225 V p 2 A 2 ) , , x 1 

Pr{|Z-E[Z]| >2A/5}<2e & ' < 2e"( 2 / 225 ) s < — . 

3/c 2 



This then implies that 



Pr {d(p,p') < d(p,p) - A} < — . 



By the union bound we get that the probability that there exists a p in . . . ,Pk 2 } such that d(p,p r ) < 
d(p, p') — A is at most 1/3. Summing up overall all possible events we get that there is a non-zero probability 
of (1), (2), (3) happening, as desired. ■ 



Proof (Theorem 21): Let (p*, q*) be an arbitrary Nash equilibrium. We show that each of p* and q* are 
highly concentrated, meaning that all but at most 8A of their total probability mass is concentrated in a set 
of size 0((A/e) 2 log(l + l/A)logn). Let's consider p* (the argument for q* is similar). We begin by 
partitioning it into its heavy and light parts. Specifically, we greedily remove the largest entries of p* and 
place them into a set H (the heavy elements) until either: 



(a) the remaining entries L (the light elements) satisfy the condition that \/i £ L, Pr[i] < ^Pr[L] for S 
as in Lemma [U or 



(b) Pr[H] > 1 - 8 A, 



whichever comes first. Using the fact that the game satisfies the well supported (e, A) -approximation stabil- 
ity condition, we will show that case (a) cannot occur first, which will imply that p* is highly concentrated. 

In the following, we denote (3 as the total probability mass over H. Assume by contradiction that case (a) 
occurs first. Note that we have \\pl\\i = 1-/3, ||pjf ||i = P, and 

ieL ieL ieL 

so UplIII ^ ^~s^ ■ L et v i,j = C(e{ — ej). Since (p*, q*) is a Nash equilibrium we know that p* ■ Vij < 
for all i and for all j G supp(q*). 
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By Lemma Q] there exists p' such that (1) d(p*,p') = 3A, (2) p' ■ Vij < p* • Vij + e for all i and for all 
j G supp(g*) and (3) d(p',pi) > d(p*,pi) — A for all i G {1, . . . , k} (here A; is the number of equilibria of 
the game). By (2) we have that p' ■ Vij < e for all i and for all j G supp(g*), which implies that (p 1 , q*) 
is a well supported e approximate equilibrium (since by (2) the column player has at most an e incentive to 
deviate and since supp(p') C supp(p*) we know that the row player has no incentive to deviate). By (1) 
we also have that (p', q*) is 3A-far from (p*, q*). We now use (3) to show that (p' , q*) is A-far from all 
the other equilibria as well. Let p be such an equilibria. Note that if d(p,p*) > 4 A, then clearly, by the 
triangle inequality d(p,p') > A. If d(p,p*) < 2A, clearly, by the triangle inequality, d(p,p') > A. Finally 
if d(p,p*) G [2A, 4A], then by (3), we that d(p,p') > A, as desired. 

Overall we get that (p',q*) is a well supported e approximate equilibria that is A-far from all the other 
equilibria of the game. This contradicts the well supported (e, A) -approximation stability condition, as 
desired. 

Thus case (b) occurs first, which implies that p* is highly concentrated. We clearly have 1 — (3 < 8 A; 
moreover, it is easy to show that the set H has at most 5 log (1 + (8A) _1 ) elements. The key idea is that 
since 1 — j3 < 8A, we can now apply the sampling argument of Jl8[| to L with accuracy parameter 0(e/A) 
and then union the result with H. Specifically, let us decompose p* as: 

p* = f3p H + (1 - f3)p L . 

Applying the sampling argument of lfl~8ll to pi, we have that by sampling a multiset X of S elements from 
L = supp(pi), we are guaranteed that for any column ej, we have: 

KUxfCej-fiCejl < (e/8A), 

where Ux is the uniform distribution over X. This means that for p = f3pn + (1 — /3)Us, all columns ej 
satisfy: 

\p* T C ej -p T C ej \ < e/2. 

We have thus found the row portion of an e-equilibrium with support of size S log (1 + (8A)" 1 ) as desired. 



Corollary 1 Let us fix e and A, < e < A < 1. Let G be a game with at most n°^/ e ^ ) Nash equi- 
libria satisfying the well supported (e, /^-approximation stability condition (or the (e/2, A) -stability to 
perturbations condition ). 

(1) Given G we can find a well-supported e-equilibrium (p, q) of G in time re°^ A / e ' 2 k>g(l+A 1 ) logn)_ 

(2) Given G', an e j ^-perturbation of G, we can find a well supported e-equilibrium (p, q) of G in time 

n O((A/e) 2 log(l+A- 1 )logn)_ 

In both cases, (p, q) is A-close to a Nash equilibrium (p*, q*) of G. 

Proof: (1) By Theorem[2l we can simply try all supports of size n°(( A ^ 2 iog(i+A )logn) an( j f or eac fa Q f 
them write an LP to search for a well-supported e-Nash equilibrium. 

(2) By Theorem|2j G has a well-supported e/3-Nash equilibrium with support of size 0((A/e) 2 log(l + 
A -1 ) log n). Since G' is an e/6-perturbation of G, then this is also a well-supported 2e/3-Nash equilib- 
rium of G'. Thus by trying all supports of size n°^ A ^ i°g(i+A ) iogn) m qi we can ^ n( j a W ell-supported 
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2e/3-Nash equilibrium of G' . Since G is an e/6-perturbation of G' , this will be a well-supported e-Nash 
equilibrium of G. ■ 



Corollary \T\ improves by a factor 0(1/(A 2 log(l + A" 1 ))) in the exponent over the bound of [18] for games 
satisfying this condition. The most interesting range of improvements happens when e is a function on n 
and A is a function of e; e.g., e = 1 A/n, A = lOe - in this case we obtain an improvement of 0(n/ log(n)) 



in the exponent over the bound of H 1 8h . 



The proof of Theorem |2] also implies an interesting structural result, namely that each Nash equilibrium of 
such a game is close to a pair of strategies of small support and by the triangle inequality, the same will 
happen for any perturbation of G. Formally: 

Theorem 3 Let us fix e and A, < e < A < 1. Consider a game G with at most n°(( A//<E ) 2 ) Nash equilib- 
ria which satisfies the well supported (e, A) -approximation stability condition (or the (e/2, A)-stability to 
perturbations condition ). Then it must be the case that: 

(1) Any Nash equilibrium in G is 8A-close to a pair of mixed strategies each with support of size at most 
0((A/e) 2 log(l + A-^logn). 

(2) For any game G' with distance e/2 of G, any Nash equilibrium in G' is 9A-close to a pair of mixed 
strategies each with support of size 0((A/e) 2 log(l + A -1 ) log n). 

So far in Theorem [2] and Corollary Q] we have considered e and A fixed. It is also interesting to consider 
games where the stability conditions hold uniformly for all e small enough. We call such games uniformly 
stable games. Formally: 

Definition 8 Consider t > 1. We say that a game is i-uniformly stable to perturbations ( or t-uniformly well 
supported approximation stable) if there exists eo = l/poly(n) such that for all e < eo, G satisfies (e,ie) 
stability to perturbations (or the well supported (e, te) approximation stability). 

For games satisfying the t-uniform stability to perturbations condition with t = 0(poly(log(n))) we can 
find 1 /poly (n) -approximate equilibria in n poly ( log ( n >' time, and more generally e-approximate equilibria 
in n log ( 1 / e )P ol y( lo s( n )) time, thus achieving a FQPTAS. This provide a dramatic improvement over the best 
worst-case bounds known. 



Corollary 2 (I) Let t = 0(poly(log(n))). There is a FQPTAS to find approximate-equilibria in games 
satisfying the t-uniform well supported approximation stability condition ( or the t-uniform stability to per- 
turbations condition) with at most ) Nash equilibria. 

(2) Games satisfying the t-uniform well supported approximation stability condition (or the t-uniform sta- 
bility to perturbations condition) with at most ) Nash equilibria have the property that for any A each 
equilibrium is SA-close to a pair of mixed strategies each with support of size 0{t 2 log(l + A^ 1 ) logra); 
moreover, for any Loo-perturbation of magnitude A/t of such games, it must be the case that any Nash equi- 
librium in G' is 9A-close to a pair of mixed strategies each with support of size 0(t 2 log(l + A -1 ) log n). 

Corollary |2] is especially interesting because the results of (8|] prove that it is PPAD-hard to find l/poly(n)- 
approximate equilibria in general bimatrix games. Our results shows that under the (widely believed) as- 
sumption that PPAD is not contained in quasi -polynomial time |@], such uniformly stable game are inher- 
ently easier for computation of approximate equilibria than general bimatrix games. Moreover, variants of 
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many games appearing commonly in experimental economics including the public goods game and identical 
interest game [14] satisfy this condition. 



5 Converting the general case to the stable case 

In this section we show that computing a e-equilibrium in a game satisfying the strong (e, 0(e 1//4 )) approx- 
imation stability is as hard as computing an 0(e 1 / 4 )-equilibrium in a general game. For our reduction, we 
show that any general game can be embedded into one having the strong (e, 0(e 1 / 4 )) approximation stabil- 
ity property such that an e equilibrium in the new game yields an 0(e 4 / 4 ) in the original game. Since both 
notions of (strong) stability to perturbations and (strong) well supported approximation stability generalize 
the strong approximation stability condition, the main lower bound in this section (Theorem |4]) applies to 
these notions as well. 

We start by stating a useful lemma that shows the existence of a family of modified matching pennies games 
that are strong approximation stable games with certain properties that will be helpful in proving our main 
lower bound. 

Lemma 2 Assume that A < 1/10. Consider the games defined by the matrices: 





1 + a± t i 


l + CKl.2 • 


• 1 + «l,n 







7i,i 


71,2 • • • 7l,n 


1 " 




1 + «2,1 


1 + a 2 ,2 • 


• 1 + «2,n 







72,1 


72,2 • • • 72,n. 


1 


R = 










, c = 










1 + %,i 


1 + «n,2 • 


• 1 + a n ,n 







7n,l 


7n,2 • • • ln,n 


1 













2A _ 




. 2A 


2A ... 2A 


_ 



where on j € [— A, 0] and 7$ j £ [0, A] for all Each such game satisfies the strong (A 2 , 4A) approxi- 
mation stability condition. Moreover if (p, q) is a A 2 -Nash equilibrium, then we must have 

A/2 < pi + ... +p n < 4A and A/2 < qi + ... + q n < 4A. 
See Appendix |B1 for a proof. We now present the main result of this section. 

Theorem 4 Computing an e-equilibrium in a game satisfying the strong (e, 8e x / 4 ) approximation stability 
condition is as hard as computing an (Se) 1 / 4 -equilibrium in a general game. 

Proof: The main idea is to construct a linear embedding of any given game into a larger game with one more 
strategy per player played with large probability, thereby compressing the incentives of the original game 
into a smaller scale. In particular, consider A = (8e) 1//4 and consider a general game with payoff matrices 
R and C. Let us construct a new game with the payoff matrix for the row player R' defined as: 

(A/2)l„ in + (A/2)R n> i 1 
0i, n 2A 

and for the column player C defined as: 

" (A/2)l„,„ + (A/2)C l n ,i " 
2Ali, n 
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(For s, r > 0, the matrix l Sj7 . is the sxr matrix with all entries set to 1 and the matrix S r is the sxr matrix 
with all entries set to 0.) By Lemma the new game defined by R' and C' satisfies the strong (A 2 , 4A) 
approximation stability condition, which in turn implies satisfying (e, 86 1 / 4 ) approximation stability (since 
e < A 2 and 4(B) 1 / 4 < 8). We show next that any A 4 /8-equilibrium in this new game (defined by R' and 
C) induces a A-equilibrium in the original game (defined by R and C). Since A = (8c) 1 / 4 , this implies 
the desired result. 

Let {p, q) be an A /8-equilibrium in the new game. By Lemma |2] (since A 4 /8 < A 2 ), p must have /3A 
probability mass in the first n rows and q must have aA probability mass in the first n columns, where 
a, P € [1/2, 4]. Let pj, qf denote p restricted to the first n rows and q restricted to the first n columns. 
hetpf = Pf/\p/\ and qf = qj/\qf\, where \pf \ = j3A and \qj\ = aA. We show that that (pf,qf) is a 
A-equilibrium in the original game defined by R and C. We prove this by contradiction. Assume this is not 
the case. Assume first that the row player has an A incentive to deviate. There must exist such that: 

efRq f >p f Rq f + A. 

Multiplying both sides by a/3A 3 /2 and using the fact that aft > 1/4 we get: 

f3Aef(A/2)Rq f > Pf (A/2)Rq f + A 4 /8. (2) 

We clearly have /3Aef (l ni „ - (A/2)l n>n )gj = pj(l n ,n - (A/2)l n>n )qf, and by adding this quantity as 
well as p n+ i(2A)q n+ i to both sides of inequality |2] we get: 

PAeJ (l n , n - (A/2)l n , n + {A/2)R)q f +p n+1 (2A)q n+1 > 
p f (l n ,n ~ (A/2)l n ,„ + (A/2)R)q f + p n+1 {2A)q n+1 + A 4 /8. 

which implies: 

(J3Aei + Pn+1 e n+l fR'q > p T R'q + A 4 /8. 

Therefore there exists a deviation for the row player (namely moving all /3A probability mass from rows 
1,2, ... ,n onto row i), yielding a benefit of A 4 /8 to the row player. This contradicts the assumption that 
(p, q) is an A 4 /8-equilibrium in the new game, as desired. 

Assume now that the column player has an A incentive to deviate. There must exist ej such that: 

PfCej > PfCq f + A. 
Multiplying both sides by a/3A 3 /2 and using the fact that a/3 > 1/4 we get: 

pJ(A/2)C7(aA ei ) > p T f {A/2)Cq f + A 4 /8. 
WehavepJ(A/2)l njn aAej = p T f (A/2)l n ^ n q f , so: 

pJ((A/2)C + (A/2)l njn )aA ej > pJ"((A/2)C + (A/2)l n , n ) g/ + A 4 /8. (3) 

We also have p n+ i(2A, . . . , 2A)aAej = p n+ i(2A, . . . , 2A)qj. By adding this quantity as well as the term 
jP/(l, • • • , l)?n+i to the both sides of the inequality |3] inequality we get: 

p T C'(aAe j + q n+ ie n +i) > p T C'q + A 4 /8. 
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Therefore there exists a deviation for the column player (namely moving all aA probability mass from 
columns 1, 2, . . . , n onto column i), yielding a benefit of A 4 /8 to the column player. This contradicts the 
assumption that (p, q) is an A 4 /8-equilibrium in the new game, as desired. ■ 

Theorem |4] implies that for any e < (l/8)(0.3393) 4 , an algorithm for finding an e-equilibria in a game 
satisfying the strong (e, 86 1 / 4 ) approximation stability condition would imply a better than currently known 
algorithm for finding approximate equilibria in general games (with an approximation factor of 0.3393). 

6 Stability in constant-sum games 

Consider a game defined by R and C. Let 

V* = {p,3 q s.t. (p, q) is a Nash equilibrium} 

and 

Q* = {q,3p s.t. (p,q) is a Nash equilibrium} . 

We say that p is A-far from V* if the minimum distance between p and p' G V* is > A. Let vr and v c be 
the unique values of the row and column player respectively in a Nash equilibrium [22]. Lemmas [3] and [4] 
below characterize constant sum games satisfying approximation stability terms of properties of the space 
of mixed strategies for the row player and column player separately. Theorem [5] gives a polynomial time 
algorithm for determining the approximately best parameters for the strong approximation stability property 
for a given game. 

Lemma 3 If for any p that is A-far from V* there exists ej such that p T Rej < vr — a and for any q that is 
A-far from Q* there exists ej such that eJCq < vc — ct, then the game satisfies the (a/2, A) approximation 
stability. 

Proof: We show that any (p, q) that is A-far from all Nash equilibria cannot be an a/2-equilibrium. Con- 
sider {p, q) that is A-far from all Nash equilibria. Then either p is A-far from V* or q is A-far from 
Q* Assume WLOG that p is A-far from p* . We know that there exists ej such that p T Rej < vr — a. 
We show that (p, q) cannot be an a/2 Nash equilibrium. If p T Rq < vr — a/2, then this is not an a/2- 
equilibrium since the row player could play its minimax optimal strategy and get vr. On the other hand if 
p T Rq > vr — a/2, then p T Cq < vc + a/2, but we know that p T Cej > vq + a, so the column player 
would have an a/2 incentive to deviate, as desired. ■ 

Lemma 4 If there exists p that is A-far from V* such that minj p T Rej > vr — a or if there exists q 
that is is A-far from Q* such that min^ eJCq > v c — ol, then the game cannot he (a, A) approximation 
stable. Moreover, if, in the former case, supp(p) C supp(p*) for some p* E V* , or if, in the latter case, 
supp(g) C supp(g*) for some q* S Q*, then the game cannot be well-supported (a, A) approximation 
stable. 

"This follows from the well known interchangeability property of constant-sum games, meaning that given two Nash equilibria 
points (pi , qi ) and (p2 ,92), the strategy pairs (pi , 52 ) and (p2 , qi ) are also Nash equilibria. To see that note that if both p and q are 
close to V* and Q* respectively, then there exists a Nash equilibria (pi, qi), (p2, q-i) such that d(p,p\) < A and d(q, qi) < A. 
By interchangeability, we get that (pi, 52) is a Nash equilibrium and we also have d((pi, 52), (p, q)) < A. 
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Proof: Assume that there exists p that is A-far from V* such that mmj p T Re 3 > vr — a. Let p* e V* be 
such that supp(p) C supp(p*) if such p* exists, else let p* 6 V* be arbitrary; let (p*,q*) be an equilibrium 
that certifies that p* that is in V*. We know that p* T Rq* = vr and p* T Cq* = vq- Clearly, (p, q*) that 
is A-far from (p*, q*). We show that (p, q*) is an a-Nash equilibrium, i.e., neither player has more than 
an a-incentive to deviate. We have p T Rq* > vr — a and p' T Rq* < vr for any p' (since q* is minimax 
optimal) so the row player has at most a-incentive to deviate. We also have p T Cq* > vc (since q* is 
minimax optimal) and the most the column player could get is vq + a since minj p T Rej > vr — a, so 
maxj p T Cej < vc + a. ■ 

If the game satisfies the strong approximation stability condition, then we can efficiently compute good 
approximations for the stability parameters. Specifically: 

Theorem 5 Given any < a < 1, we can use Algorithm\J}to whp determine A such that the game satisfies 
the (a/2, 2A) strong approximation stability property, but not (a, A/2) strong approximation stability. The 
running time is polynomial n oi < l l a >. 

Proof: We first find a minimax optimal solution (p*, q*) and then in Step 2, a small support a-Nash (p', q'). 
In Step 3 we find A such that all a/2-Nash equilibria are within distance A of (p',q') and there exists 
an a-Nash equilibrium at distance A from (p ; ,q'). From the perspective of the row player, as shown in 
Lemma [3l if there exists ej such that p T Rej < vr — a, then (p, q) cannot be an a/2 Nash equilibrium 
for any q, so all a/2 Nash equilibria must satisfy p T Rej > vr — a for all j. As shown in Lemma|4j for 
any p such that p T Rej > vr — a, for all j we have that (p, q*) is an a Nash equilibrium. Similarly for 
the column player. So all a/2-equilibria must be at distance at most A from [p\ q'), where A is the output 
of Algorithm [U and there exists an a-Nash equilibrium that is at distance at A from (p',q') ■ By triangle 
inequality, we obtain that the game is (a/2, 2A) stable and it is not (a, A/2) stable. Note that the running 
time is polynomial since we we perform steps (A), (B) at most n°^ l ^ a ) times, so overall the running time 
is polynomial n°^/ a >. ■ 

In order to determine the approximate parameters for the strong well supported approximation stability 
property for a given game, we can adapt Algorithm Q] as follows. Given any < a < 1, we can use 
Algorithm Q] to whp determine A h. We then we re-run Algorithm [Q but in the LP in (A) we add the 
constraint that pi = for all i ^ supp(p*) (and similarly for q in the LP (B)) to get a value A/. Then by 
lemmas [3] and HJ we are guaranteed that the game satisfies the (a/2, 2 Ah), but not the (a, Ai/2) strong well 
supported approximation stability property. 
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A Standard Facts 

We start by stating the McDiarmid inequality (see iflill ) we use in our proofs: 

Theorem 6 Let Y\, ...,Y n be independent random variables taking values in some set A, and assume that 
t : A n —> R satisfies: 

sup_ \t(yi,...,y n ) - t(y l ,...,y i _ l ,y i ,y i+1 ,y n )\ < a, 
y 1 ,...,y„£A,y t £A 

for all i, 1 < i < n. Then for all 7 > we have: 

2 n 

Pr{\t(Y u ...,Y n ) - E[t(Yi,...,r n )]| > 7} < 2e" 27 <=i * 

We now state a well known fact showing any pair of strategies that is sufficiently close to a Nash equilibrium 
is a sufficiently good approximate Nash equilibrium. 

Claim 1 If(p,q) is a-close to a Nash equilibrium (p*,q*) (i.e., if d((p,q),(p* ,q*)) < a), then (p,q) is a 
3a-Nash equilibrium. 



B Additional Proofs 

Lemma [2] Assume that A < 1/10. Consider the games defined by the matrices: 
R = 



1 + Oil,! 


1 + ai t 2 ■ ■ ■ 


1 + ai,n 







' 7i,i 


71,2 • 


• 71,71 


1 


1 + «2,1 


1 + a 2 ,2 ■ ■ ■ 


1 + «2,n 







72,1 


72,2 • 


• 72,71 


1 










, c = 










1 + On,l 


1 + a n ,2 ■ ■ ■ 


1 + Ot n ,n 







7n,l 


7n,2 ■ 


7n,n 


1 











2A 




2A 


2A . 


. 2A 






where a^j £ [— A,0] and G [0, A 



for all This game satisfies the strong (A 2 , 4A) approximation 



stability condition. Moreover if (p, q) is a A 2 -Nash equilibrium, then we must have 
A/2 <pi + ... + p n < 4A and A/2 < q t + ... + q n < 4A. 

Proof: First note that e^ +l Rq = 2Aq n+ i and 



ef Rq = (1 + at,i)gi + (1 + a ij2 )q2 + ■ ■ ■ + (1 + ol% 



for 1 < i < n. 
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Also p T Ce n+ i = pi + . . . + p n and 

p T Cej = pi7ij + . . . + Pn7n,j + 2Ap n+ i for 1 < j < n. 

By a simple case analysis, one can show that any Nash equilibrium (p, q) must have < p n +i < 1 and 
< q n +i < 1- (This is also implicit in our analysis on A 2 -Nash equilibria below). This then implies that in 
any Nash equilibrium (p, q) such that pi ^ we must have: 

2Ag n+ i = (1 + ai,i)gi + (1 + a i>2 )q2 + . . . + (1 + a i<n )q n . (4) 
Similarly, in any Nash equilibrium (p, q) such that qj / Owe must have: 

Pi + ■ • ■ Pn = Pill ,j + • • • Pnln ,j + 2Ap n+i . (5) 

Identities 0] and [5] together with the fact that a^j € [— A,0] and j^j G [0, A] for all imply that there 
must exist a Nash equilibrium (p, q) satisfying: 

2A 2A 2A 2A 

< pi + ... +p n < - and - - < gi + ... + q n . < - . (6) 



1 + 2A " ^ " 1 + A 1 + 2A " ^ w " 1 + A 

To get the desired stability guarantee we now show that any A 2 -equilibrium must have A/2 < p\+--+p n < 
4 A and A/2 < gi + . . . + q n < 4A. (This in turn implies that any A 2 -equilibrium must be at distance at most 
4A from a Nash equilibrium satisfying relation We prove this by contradiction. Consider an arbitrary 
A 2 -equilibrium (p, q). We analyze a few cases. 

Case 1: Suppose p n +i > 1 — A/2. Then the column player's payoff for column n + 1 is p T Ce n+ \ = 
Yli=i Pi — A/2. But the column player's payoff for a column j G {1, n} is: 

n 

P T ° e J =J2^Pi + 2A Pn+l > 2A (! " A / 2 )- 

8=1 

If q n +i > 1/2 then the column player has incentive to deviate at least: 

p T Ce j -p T Cq > (1/2)[2A(1 - A/2) - A/2] > A/2 > A 2 , 

which cannot happen since (p, q) is a A 2 -equilibrium. On the other hand if q n +i < 1/2, then the row player 
has huge incentive to deviate. Specifically, the row's player payoff for row 1 is e\Rq > (1/2)(1 - A), 
but row's player payoff for row n + 1 is e^ +1 Rq < (1/2)2A = A. Thus in this case the row player has 
incentive to deviate at least: 

e\Rq-p T Rq > (1 - A/2)[l/2(l - A) - A] > A, 

which cannot happen since (p, q) is a A 2 -equilibrium. 

Case 2: Suppose p n +i < 1 — 4A. Then the column player's payoff for column n + 1 is p T Ce n+ \ = 
Y17=iPi — whereas the column player's payoff for a column j 6 {1, n} is: 

n 

p T Cej = pijij + . .-Pnlnj + 2Ap„ + i < A ^Pi + 2Ap„ + i = A(l - p n+1 ) + 2Ap n+1 < 2A. 
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So, if q n +\ < 1 — A/2, then the column player has incentive to deviate at least: 

p T Ce n+1 - p T Cq > (A/2)[4A - 2A] > A 2 , 

contradiction. On the other hand, if q n +i > 1 — A/2, then the row player's payoff for row n + 1 is 
e^ +1 Rq > 2A(1 — A/2), but the row player's payoff for a rows i £ {1, n} is: 

n 

ejRq = ^(1 + aij)qj < 1 - q n +i < A/2. 
i=i 

So, in this case, the row player has incentive to deviate at least: 

e^ +1 Rq - p T Rq > 4A[2A(1 - A/2) - A/2] > A 2 , 
which cannot happen since (p, q) is a A 2 -equilibrium. 

Case 3: Suppose q n +i > 1 — A/2. As in the bottom-half of the case 2 analysis we have that the row player's 
payoff for row n + 1 is e^ +1 Rq > 2A(1 — A/2), but the row player's payoffs for rows 1, n are < A/2. 
So, if Pn+i < 1 — A then the row player has incentive to deviate at least: 

el +l Rq - p T Rq > A[2A(1 - A/2) - A/2] > A 2 , 

which cannot happen since (p, q) is a A 2 -equilibrium. On the other hand, if p n +\ > 1 — A, then the column 
player's payoff for column n + 1 is p T Ce n+ \ = Y^7=i P' 1 — ^> ^ ut tne c °l umn player's payoff for a columns 
j G {1, ...,n} is: 

p T C ej = pi7ij + . ..pnlnj + 2Ap„ +1 > (1 - A) (2 A). 
So, the column player has incentive to deviate at least: 

p T Ce j -p T Cq > g n+ i[2A(l - A) - A] > A/2 > A 2 , 

which cannot happen since (p, q) is a A 2 -equilibrium. 

Case 4: Finally assume that q n +i < 1 — 4A. Then the row player's payoff for row n+1 is e^ +1 Rq < 
2A(1 — 4A), but row player's payoffs for rows 1, n are > (4A)(1 — A). So, if p n +i > 1/2 then Row 
has incentive to deviate at least: 

el +l Rq - p T Rq > (1/2)[4A(1 - A) - 2A(1 - 4A)] > A/2 > A 2 , 

which cannot happen since (p, q) is a A 2 -equilibrium. Finally if p n +i < 1/2 < 1 — 4A we apply the 
analysis in case 2. 

Thus any A 2 -equilibrium must have A/2 < p\ + ... + p n < 4 A and A/2 < qi + ... + q n < 4 A, as desired. 
This concludes the proof. ■ 

Lemma 5 Assume that the game Q satisfies the (e, A) -approximation stability and that the union of all 
A-balls around all Nash equilibria do not cover the whole space. Then we must have 3A > e. 
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Proof: Since the union of all A-balls around all Nash equilibria do not cover the whole space, we must have 
a (p, q) that is at distance exactly A from some fixed Nash equilibrium and that is A-far from all the other 
Nash equilibria. By Claim Q] we also have that this is a 3 A Nash equilibrium. This then implies the desired 
result. ■ 



Lemma 6 Assume that the bimatrix game Q specified by R and C has a non-pure Nash equilibrium. 

(a) If Q satisfies the strong well supported (e, ^-approximation stability condition, then we must have 
A > e/4. 

(b) IfQ satisfies the strong (e, A)- stability to perturbations condition, then we must have A > e/8. 

Proof: Assume Q satisfies the strong well supported (e, A) -approximation stability condition. By definition, 
there exists a Nash equilibrium (p*, q*) such that any well supported e-equilibrium is A-close to (p*, q*). 
Let (p, q) be an arbitrary non-pure Nash equilibrium of Q and assume WLOG that p is a mixed strategy. 
Consider an a internal deviation of the row player, i.e., consider p' with supp(p') C supp(p) such that 
d(p,p') = a . Since supp(p') C supp(p) we have p' T Rq = p T Rq. Since (p, q) is a Nash equilibrium we 
have p T Cej = p T Cq = vc for all j G supp(g) and p T Cej < vc for all j ^ supp(<7). Since d(p,p') = a 
we have 

\p' T C ej -p T C ej \ < \(p' -p) T C ej \ < a, 

for all j, so p' T Cej > vc — a, for all j G supp(<7) and p' T Cej < vc + a, for all j ^ supp(g). Thus 
(p',q) is a well supported 2a-Nash equilibrium. By construction, we have d((p',q), (p,q)) = a. Since d 
is a metric, by the triangle inequality, we get that at least one of the pairs (p' , q) and (p, q) is at least a/2 
far from (p* ,q*); however they are both 2a well supported Nash equilibria. This implies that we must have 
A > e/4, as desired. By Theorem [Q we immediately get (b) as well. ■ 



C Examples 

To illustrate our notions of stability, we present two n by n games satisfying uniform stability to perturba- 
tions. 

Example 1 A classic game from experimental economics is the public goods game which is defined is as 
follows. We have two players and each can choose to play a number between and n — 1 corresponding to 
an amount of money to contribute. If the Row player contributes i dollars and the Column player contributes 
j dollars, then each gets back 0.75(i + j). So the payoff to the Row player is 0.75(i +j) — i and the payoff 
to the Column player is 0.75(i + j) — j, where i G {0, 1, n — 1} and j G {0, 1, n — 1}. This has 
payoffs ranging from up to 0.75(n — 1), so to scale to the range [0, 1] as we do in our paper, we multiply 
all the payoffs by 1/n. I.e., if the Row player plays i and the Column player plays j then the payoff to the 
Row player is [0.75j — 0.25i]/n and the payoff to the Column player is [0.75i — 0.25j]/re. 

First note that this game is (e, 0) stable to perturbations for all e < l/(8n). To see this note that without any 
perturbation, for any j and any i > 1 we have e^Rej — ejRej = 0.25i/n > 0.25/n. That means that the 
Row player prefers playing action compared to action i by 0.25i/n > 0.25/n. So, in a game R', C that 
is an Lqo e-perturbation of of our game we get: e T R'ej — ejR'ej > 0.25/n — 2e > 0. That means that in 
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the perturbed game, the Row player still prefers playing action 0. This implies that the only equilibrium in 
the perturbed game has the Row player playing action 0, and similarly the Column player playing 0, so the 
only equilibrium is (0, 0). 

We now claim that this game is not (e, 0.99) stable for any e > l/(4n). To see this consider adding e to 
R[l, 0]. I.e., if the Row player plays action 1 and the Column player plays action 0, then the payoff to Row 
player is — 0.25/n + e > 0. Now, (1, 0) is a Nash equilibrium since this payoff is strictly greater than R[i, 0] 
for any i + 1 1. In particular, R[0, 0] = and R[i, 0] < for all i > 2. So, there is now a Nash equilibrium 
(actually the unique Nash equilibium) at variation distance 1 from the original Nash equilibrium. 

Example 2 We present here a variant of the identical interest game. Both players have n available actions. 
The first action is to stay home, and the other actions correspond to n— 1 different possible meeting locations. 
If a player chooses action 1 (stay home), his payoff is 1/2 no matter what the other player is doing. If the 
player chooses to go out to a meeting location, his payoff is 1 if the other player is there as well and it is 
otherwise. Formally, = 1/2 for all j, R[i,i] = 1 for all z > 1 R[i, j] = for i > 1, j ^ i. and 

similarly C[i, 1] = 1/2, C[j,j] = 1 for j > 1, C[i, j] = for j > 1, i ^ j. We claim that this game is well 
supported (e, 2e)-stable for all e < 1/6, so it is 2-uniformly stable. 

Note that ef Rq = 1/2 and efRq = qi for i > 1. Similarly, p T Ce\ = 1/2 and p T Cei = pi for i > 1. 
Note that if (p, q) is an well supported e-Nash equilibrium and if < 1/2 — e for i > 1 then both p-i = 
and qi = 0. This follows immediately since efRq = 1/2 and efRq = q\ for i > 1, so pi must equal 
on any action whose expected payoff is < 1/2 — e. Since pi = 0, must equal as well in order to be 
well-supported. Also note that if (p, q) is an well supported e-Nash equilibrium and if qi > 1/2 + e for i > 
then both p, L = 1 and qi = 1. If gj > 1/2 + e we have efRq = 1/2 + e and since ej Rq < 1/2 for j ^ i we 
must have pi = 1. This in turn implies g» = 1. 

Similarly, we can show the same for the row player as well. These imply that the well supported e-Nash 
equilibria that are not already Nash equilibria must satisfy: for any action i > 1, i G supp(g), we have 
1/2 — e < qi < 1/2 + e and 1/2 — e < pi < 1/2 + e. Similarly, for any action i > 1, i G supp(p), we 
have 1/2 — e < g/ < 1/2 + e and 1/2 — e < pi < 1/2 + e. We have two cases. The first one is if there is 
exactly one action i > 1 in supp(g). In that case, (p, q) has distance at most e from the Nash equilibrium 
(l/2eo + l/2ej, l/2eo + l/2ej). The second one is if there are two such actions i, j > Oinsupp(g). In that 
case, (p, q) has distance at most 2e from the Nash equilibrium (l/2ej + l/2ej, l/2e, + l/2ej), as desired. 
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Algorithm 1 Determining the strong stability parameters of a constant sum game. 
Input: R, C, parameter a. 

1. Solve for minimax optimal (p*, q*). 



2. Step 2: Apply the sampling procedure in IU8I1 from (p*,q*) to get (p',q') with support of size 
0((logn)/a 2 ) that is an a-Nash. Set A = 0. 

3. Find A as follows: 

(A) For each partition of the support of p' into supp + and supp„ do: 

i. Solve the following LP: 

maxA= ^ (Pi-Pi)+ ^2 (Pi-Pi)+ Y Pi 

i€supp + iGsupp_ ie{l,2,...,ra}\supp(p') 

s.t. pi > p\ for all i G supp + 
Pi < Pi for all i € supp_ 
p T Rej > vr — a for all j 

ii. If A is smaller than v, the value of the previous LP, reset A to be v. 

(B) for each partition of the support of q' into supp + and supp_ do: 

i. Solve the following LP: 

maxA= ( ( H- ( li)+ qi 

iGsupp + iGsupp_ iG{l,2,...,n}\supp(q') 

s.t. qi > q'i for all i G supp + 
Qi < q'i for all i G supp_ 
e JCq > vq — ol for all j 

ii. If A is smaller than v, the value of the previous LP, reset A to be v. 
Output: Radius A. 
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